Automatic Video Editing for Multimodal Meetings
نویسندگان
چکیده
Meeting recording is being performed through microphones and video cameras in order to keep a permanent record of the events that are happening during the meetings. The technology to perform such recording is already mature and recording is being performed already for some time. However, efficient retrieval of the information from meeting data remains a hot topic of contemporary research. Several approaches to information retrieval exist, such as indexing the data, event semantics analysis of the data, etc. This contribution focuses on automatic video editing of the data in order to prepare audiovisual material, based on several audio and video sources, that is suitable for human users to see. The video editing takes the original audio and video data as its input as well as the results of analysis of audio and video streams and user instructions. The output of the method is a simple audiovisual stream.
منابع مشابه
Automatic analysis of multiparty meetings
This paper is about the recognition and interpretation of multiparty meetings captured as audio, video and other signals. This is a challenging task since the meetings consist of spontaneous and conversational interactions between a number of participants: it is a multimodal, multiparty, multistream problem. We discuss the capture and annotation of the AMI meeting corpus, the development of a m...
متن کاملAutomated Video Editing for Meeting Scenarios Applying Multimodal Low Level Feature Fusion
Most of the employees dislike business meetings because of the effort, the duration and the low efficiency. The AMIDAproject [1] attempts to increase the efficiency by the use of modern machine-learning techniques. One of the ideas of AMIDA is that a camera selection could be performed in smart-meeting rooms, which are equipped with several cameras, so that the most relevant information is show...
متن کاملInvited Talk: Recognition and Understanding of Meetings
This paper is about interpreting human communication in meetings using audio, video and other signals. Automatic meeting recognition and understanding is extremely challenging, since communication in a meeting is spontaneous and conversational, and involves multiple speakers and multiple modalities. This leads to a number of significant research problems in signal processing, in speech recognit...
متن کاملRecognition and Understanding of Meetings
This paper is about interpreting human communication in meetings using audio, video and other signals. Automatic meeting recognition and understanding is extremely challenging, since communication in a meeting is spontaneous and conversational, and involves multiple speakers and multiple modalities. This leads to a number of significant research problems in signal processing, in speech recognit...
متن کاملAudio-Visual Processing in Meetings: Seven Questions and Some AMI Answers
The project Augmented Multi-party Interaction (AMI) is concerned with the development of meeting browsers and remote meeting assistants for instrumented meeting rooms – and the required component technologies R&D themes: group dynamics, audio, visual, and multimodal processing, content abstraction, and human-computer interaction. The audio-visual processing workpackage within AMI addresses the ...
متن کامل